A multimodal density function estimation approach to formant tracking

نویسندگان

  • Sundar Harshavardhan
  • Chandra Sekhar Seelamantula
  • Thippur V. Sreenivas
چکیده

We address the problem of robust formant tracking in continuous speech. We propose the robust statistical model of t-distribution mixture density (tMM) operating on the “pyknogram” obtained through a multiband AM-FM demodulation technique. The statistical model of the pyknogram is shown to be more-effective to handle the variability in the signal processing stage. The t-mixture density estimation is shown to be more effective than Gaussian mixture density because of outlier data in the pyknogram. For formant tracking, we show that the tMM is better in terms of parameter selection, accuracy, and smoothness of the estimate. We present experimental results on simulated data, real speech sentences, and test the robustness of the proposed MDA-tMM method to additive noise. Comparisons with PRAAT software and a recentlydeveloped adaptive filterbank technique show that the proposed MDAtMM method is superior in several aspects.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A New Method of Formant Analysis

This paper presents a novel method of formant tracking. It shows that formant information can be extracted from the cepstrum coefficients . Explicit nonlinear formulas has been developed to map psd (power spectral density) of speech signal to formant frequencies. Formants can be tracked by ESPRIT (Estimation of Signal Parameters via Rotational Invariance Techniques). An ”‘constrained”’ ESPRIT s...

متن کامل

Formant Estimation and Tracking Using Deep Learning

Formant frequency estimation and tracking are among the most fundamental problems in speech processing. In the former task the input is a stationary speech segment such as the middle part of a vowel and the goal is to estimate the formant frequencies, whereas in the latter task the input is a series of speech frames and the goal is to track the trajectory of the formant frequencies throughout t...

متن کامل

Hierarchical approach to formant detection and tracking through instantaneous frequency estimation - Electronics Letters

Formant frequencies, represented by major peaks in the spectrum of speech signals, convey important information about speech. The authors propose a method for detecting the formants of voiced speech through ‘instantaneous frequency’ (IF) estimation using a recursive least square (RLS) algorithm. The accuracy of the technique is assessed by comparing it with conventional formant detection techni...

متن کامل

Formant-tracking Linear Prediction Models for Speech Processing in Noisy Enviroments

This paper presents a formant-tracking method for estimation of the time-varying trajectories of a linear prediction (LP) model of speech in noise. The main focus of this work is on the modelling of the non-stationary temporal trajectories of the formants of speech for improved LP model estimation in noise. The proposed approach provides a systematic framework for modelling the inter-frame corr...

متن کامل

Formant-tracking linear prediction models for speech processing in noisy environments

This paper presents a formant-tracking method for estimation of the time-varying trajectories of a linear prediction (LP) model of speech in noise. The main focus of this work is on the modelling of the non-stationary temporal trajectories of the formants of speech for improved LP model estimation in noise. The proposed approach provides a systematic framework for modelling the inter-frame corr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010